59 research outputs found

    Building and Using Geospatial Ontology in the BioCaster Surveillance System

    Get PDF

    BioCaster: detecting public health rumors with a Web-based text mining system

    Get PDF
    Summary: BioCaster is an ontology-based text mining system for detecting and tracking the distribution of infectious disease outbreaks from linguistic signals on the Web. The system continuously analyzes documents reported from over 1700 RSS feeds, classifies them for topical relevance and plots them onto a Google map using geocoded information. The background knowledge for bridging the gap between Layman's terms and formal-coding systems is contained in the freely available BioCaster ontology which includes information in eight languages focused on the epidemiological role of pathogens as well as geographical locations with their latitudes/longitudes. The system consists of four main stages: topic classification, named entity recognition (NER), disease/location detection and event recognition. Higher order event analysis is used to detect more precisely specified warning signals that can then be notified to registered users via email alerts. Evaluation of the system for topic recognition and entity identification is conducted on a gold standard corpus of annotated news articles

    Information Retrieval Systems Adapted to the Biomedical Domain

    Get PDF
    The terminology used in Biomedicine shows lexical peculiarities that have required the elaboration of terminological resources and information retrieval systems with specific functionalities. The main characteristics are the high rates of synonymy and homonymy, due to phenomena such as the proliferation of polysemic acronyms and their interaction with common language. Information retrieval systems in the biomedical domain use techniques oriented to the treatment of these lexical peculiarities. In this paper we review some of the techniques used in this domain, such as the application of Natural Language Processing (BioNLP), the incorporation of lexical-semantic resources, and the application of Named Entity Recognition (BioNER). Finally, we present the evaluation methods adopted to assess the suitability of these techniques for retrieving biomedical resources.Comment: 6 pages, 4 table

    Structuring an event ontology for disease outbreak detection

    Get PDF
    <p>Abstract</p> <p>Background</p> <p>This paper describes the design of an event ontology being developed for application in the machine understanding of infectious disease-related events reported in natural language text. This event ontology is designed to support timely detection of disease outbreaks and rapid judgment of their alerting status by 1) bridging a gap between layman's language used in disease outbreak reports and public health experts' deep knowledge, and 2) making multi-lingual information available.</p> <p>Construction and content</p> <p>This event ontology integrates a model of experts' knowledge for disease surveillance, and at the same time sets of linguistic expressions which denote disease-related events, and formal definitions of events. In this ontology, rather general event classes, which are suitable for application to language-oriented tasks such as recognition of event expressions, are placed on the upper-level, and more specific events of the experts' interest are in the lower level. Each class is related to other classes which represent participants of events, and linked with multi-lingual synonym sets and axioms.</p> <p>Conclusions</p> <p>We consider that the design of the event ontology and the methodology introduced in this paper are applicable to other domains which require integration of natural language information and machine support for experts to assess them. The first version of the ontology, with about 40 concepts, will be available in March 2008.</p

    INSGFP/w human embryonic stem cells facilitate isolation of in vitro derived insulin-producing cells

    Get PDF
    AIMS/HYPOTHESIS: We aimed to generate human embryonic stem cell (hESC) reporter lines that would facilitate the characterisation of insulin-producing (INS⁺) cells derived in vitro. METHODS: Homologous recombination was used to insert sequences encoding green fluorescent protein (GFP) into the INS locus, to create reporter cell lines enabling the prospective isolation of viable INS⁺ cells. RESULTS: Differentiation of INS(GFP/w) hESCs using published protocols demonstrated that all GFP⁺ cells co-produced insulin, confirming the fidelity of the reporter gene. INS-GFP⁺ cells often co-produced glucagon and somatostatin, confirming conclusions from previous studies that early hESC-derived insulin-producing cells were polyhormonal. INS(GFP/w) hESCs were used to develop a 96-well format spin embryoid body (EB) differentiation protocol that used the recombinant protein-based, fully defined medium, APEL. Like INS-GFP⁺ cells generated with other methods, those derived using the spin EB protocol expressed a suite of pancreatic-related transcription factor genes including ISL1, PAX6 and NKX2.2. However, in contrast with previous methods, the spin EB protocol yielded INS-GFP⁺ cells that also co-expressed the beta cell transcription factor gene, NKX6.1, and comprised a substantial proportion of monohormonal INS⁺ cells. CONCLUSIONS/INTERPRETATION: INS(GFP/w) hESCs are a valuable tool for investigating the nature of early INS⁺ progenitors in beta cell ontogeny and will facilitate the development of novel protocols for generating INS⁺ cells from differentiating hESCs

    Search for post-merger gravitational waves from the remnant of the binary neutron star merger GW170817

    No full text
    In Advanced LIGO, detection and astrophysical source parameter estimation of the binary black hole merger GW150914 requires a calibrated estimate of the gravitational-wave strain sensed by the detectors. Producing an estimate from each detector's differential arm length control loop readout signals requires applying time domain filters, which are designed from a frequency domain model of the detector's gravitational-wave response. The gravitational-wave response model is determined by the detector's opto-mechanical response and the properties of its feedback control system. The measurements used to validate the model and characterize its uncertainty are derived primarily from a dedicated photon radiation pressure actuator, with cross-checks provided by optical and radio frequency references. We describe how the gravitational-wave readout signal is calibrated into equivalent gravitational-wave-induced strain and how the statistical uncertainties and systematic errors are assessed. Detector data collected over 38 calendar days, from September 12 to October 20, 2015, contain the event GW150914 and approximately 16 of coincident data used to estimate the event false alarm probability. The calibration uncertainty is less than 10% in magnitude and 10 degrees in phase across the relevant frequency band 20 Hz to 1 kHz

    First narrow-band search for continuous gravitational waves from known pulsars in advanced detector data

    Get PDF
    In Advanced LIGO, detection and astrophysical source parameter estimation of the binary black hole merger GW150914 requires a calibrated estimate of the gravitational-wave strain sensed by the detectors. Producing an estimate from each detector's differential arm length control loop readout signals requires applying time domain filters, which are designed from a frequency domain model of the detector's gravitational-wave response. The gravitational-wave response model is determined by the detector's opto-mechanical response and the properties of its feedback control system. The measurements used to validate the model and characterize its uncertainty are derived primarily from a dedicated photon radiation pressure actuator, with cross-checks provided by optical and radio frequency references. We describe how the gravitational-wave readout signal is calibrated into equivalent gravitational-wave-induced strain and how the statistical uncertainties and systematic errors are assessed. Detector data collected over 38 calendar days, from September 12 to October 20, 2015, contain the event GW150914 and approximately 16 of coincident data used to estimate the event false alarm probability. The calibration uncertainty is less than 10% in magnitude and 10 degrees in phase across the relevant frequency band 20 Hz to 1 kHz

    A framework for enhancing spatial and temporal granularity in report-based health surveillance systems

    No full text
    Abstract Background Current public concern over the spread of infectious diseases has underscored the importance of health surveillance systems for the speedy detection of disease outbreaks. Several international report-based monitoring systems have been developed, including GPHIN, Argus, HealthMap, and BioCaster. A vital feature of these report-based systems is the geo-temporal encoding of outbreak-related textual data. Until now, automated systems have tended to use an ad-hoc strategy for processing geo-temporal information, normally involving the detection of locations that match pre-determined criteria, and the use of document publication dates as a proxy for disease event dates. Although these strategies appear to be effective enough for reporting events at the country and province levels, they may be less effective at discovering geo-temporal information at more detailed levels of granularity. In order to improve the capabilities of current Web-based health surveillance systems, we introduce the design for a novel scheme called spatiotemporal zoning. Method The proposed scheme classifies news articles into zones according to the spatiotemporal characteristics of their content. In order to study the reliability of the annotation scheme, we analyzed the inter-annotator agreements on a group of human annotators for over 1000 reported events. Qualitative and quantitative evaluation is made on the results including the kappa and percentage agreement. Results The reliability evaluation of our scheme yielded very promising inter-annotator agreement, more than a 0.9 kappa and a 0.9 percentage agreement for event type annotation and temporal attributes annotation, respectively, with a slight degradation for the spatial attribute. However, for events indicating an outbreak situation, the annotators usually had inter-annotator agreements with the lowest granularity location. Conclusions We developed and evaluated a novel spatiotemporal zoning annotation scheme. The results of the scheme evaluation indicate that our annotated corpus and the proposed annotation scheme are reliable and could be effectively used for developing an automatic system. Given the current advances in natural language processing techniques, including the availability of language resources and tools, we believe that a reliable automatic spatiotemporal zoning system can be achieved. In the next stage of this work, we plan to develop an automatic zoning system and evaluate its usability within an operational health surveillance system.</p
    corecore